Scene Parsing by Data Driven Cluster Sampling

نویسندگان

  • Quan Zhou
  • Tianfu Wu
  • Wen-yu Liu
  • Song-Chun Zhu
چکیده

This paper presents a data-driven cluster sampling framework for parsing scene images into generic regions (such as the sky, mountain and water) and objects (such as cows, horses and cars). We adopt generative models for both generic regions and objects, thus their likelihood probabilities are comparable and are learned under a common information projection principle. The inference algorithm follows the data-driven Markov Chain Monte Carlo (DDMCMC) paradigm where the object and generic region models cooperate and compete for an optimal interpretation of the scene in a Bayesian framework. The algorithm has two phases: (i) Bottom-up computation for generating data-driven proposals. There are two types of proposals: proposals for regular-shape objects using the active basis models and proposals for both generic regions and irregularshape objects (such as crouching cows) by training a set of discriminative models on the appearance. A candidacy graph is constructed to summarize all the bottomup information by treating proposals as nodes and cooperative/competitive contextual relations among proposals as +/edges. (ii) Top-down computation by cluster sampling for seeking the optimal solution that maximizes the Bayesian posterior probability. The cluster sampling algorithm consists of reversible jumps to explore the solution space effectively. At each step, it samples the +/edge probabilities on the candidacy graph and divides the candidacy graph into a set of composQ. Zhou†,∗, T.F. Wu‡,∗, W.-Y. Liu† and S.-C. Zhu‡,?,∗ Department of †Electronic and Information Engineering, Hua Zhong University of Science and Technology, Wuhan, China Department of ‡Statistics and ?Computer Science, University of California, Los Angeles, USA ∗Lotus Hill Research Institute (LHI), Ezhou, China E-mail: [email protected], {tfwu, sczhu}@stat.ucla.edu, [email protected] ite connected-components (CCCP’s) on which the reversible jumps are carried out. In experiments, our algorithm outperforms the state-of-the-art methods on the LHI 15-class dataset and obtains comparable results on the MSRC 21-class dataset (the LHI 15-class dataset has more accurate annotations and we release it with this publication).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تأثیر ساخت‌واژه‌ها در تجزیه وابستگی زبان فارسی

Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...

متن کامل

Scene Parsing Using Scene Attributes As Global Features

Data-driven methods have been proven very effective for the task of scene parsing. A crucial step in these methods is to retrieve a set of visually similar scenes from existing image collections for the query image according to certain global scene representations. In this work, we incorporate scene attributes into data-driven scene parsing systems as global scene features. We show that when us...

متن کامل

Image Parsing via Stochastic Scene Grammar

This paper proposes a parsing algorithm for scene understanding which includes four aspects: computing 3D scene layout, detecting 3D objects (e.g. furniture), detecting 2D faces (windows, doors etc.), and segmenting background. In contrast to previous scene labeling work that applied discriminative classifiers to pixels (or super-pixels), we use a generative Stochastic Scene Grammar (SSG). This...

متن کامل

On the Influence of Superpixel Methods for Image Parsing

Image parsing describes a very fine grained analysis of natural scene images, where each pixel is assigned a label describing the object or part of the scene it belongs to. This analysis is a keystone to a wide range of applications that could benefit from detailed scene understanding, such as keyword based image search, sentence based image or video descriptions and even autonomous cars or rob...

متن کامل

Scene Parsing and Fusion-Based Continuous Traversable Region Formation

Determining the categories of different parts of a scene and generating a continuous traversable region map in the physical coordinate system are crucial for autonomous vehicle navigation. This paper presents our efforts in these two aspects for an autonomous vehicle operating in open terrain environment. Driven by the ideas that have been proposed in our Cognitive Architecture, we have designe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011